Inference on unseen data related to Protein Solubility:

Users are interested to assess the performance of novel most informative residue distribution based neural network on unseen data.

  • Users can upload a csv file of test Protein Solubility sequences.
  • Users can also input Protein Solubility sequence.
  • Input file must contains only Protein Solubility sequences.
  • User must have to select one Protein Solubility specie.
  • On successful activation of processing command, exploratory data analysis engine will process the data shortly in order to predict the label against sequences.
  • User will be able to download the result file after data processing by clicking on button

Training the Model from Scratch

  • Users need to provide a csv file containing Protein Solubility data.
  • User has the freedom to choose value of K-tuple (K-mer).
  • User has the freedom to choose data split method.
  • User has the freedom to choose number of folds for data split.
  • User has the freedom to choose machine learning classifier.
  • Before starting the training process, user need to do:
  • Sign up preferably using organizational email account with providing the required data and purpose of experimentation
  • After the completion of SignUp process, one need to wait for approval of account and permission for training
  • If the request is approved, you will be able to login just for one time training.
  • On successful activation of processing command, exploratory model training engine will process the data shortly in order to train the model.
  • At the end of training, users can download performance related artifacts to analyze the model behavior.